On Hidden Markov Model Maximum Negentropy Beamforming

نویسندگان

  • Barbara Rauch
  • Kenichi Kumatani
  • Friedrich Faubel
  • John McDonough
  • Dietrich Klakow
چکیده

In prior work, we developed a beamforming algorithm intended for automatic recognition of speech data captured with an array of distant microphones. In addition to enforcing a distortionless contraint in a desired direction, we adjusted the sensor weights so as to maximimize a negentropy criterion. Negentropy is a measure of how non-Gaussian the probability density function (pdf) of a random variable is. It is known that subband samples of speech are highly non-Gaussian, but become more Gaussian when corrupted with noise or reverberation. Here we extend our prior algorithm by using an auxiliary hidden Markov model to model the nonstationarity of speech during beamforming. In a set of far-field ASR experiments on data from the Multi-Channel Wall Street Journal Audio-Visual Corpus, we were able to reduce the word error rate from 14.6% to 13.6% by accounting for this non-stationarity.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modelling the nonstationarity of speech in the maximum negentropy beamformer

State-of-the-art automatic speech recognition (ASR) systems can achieve very low word error rates (WERs) of below 5% on data recorded with headsets. However, in many situations such as ASR at meetings or in the car, far field microphones on the table, walls or devices such as laptops are preferable to microphones that have to be worn close to the user’s mouths. Unfortunately, the distance betwe...

متن کامل

Maximum Negentropy Beamforming

In this paper, we address an adaptive beamforming application based on the capture of far-field speech data from a single speaker in a real meeting room. After the position of a speaker is estimated by a speaker tracking system, we construct a subband-domain beamformer in generalized sidelobe canceller (GSC) configuration. In contrast to conventional practice, we then optimize the active weight...

متن کامل

Towards Online Maximum Kurtosis Beamforming

In prior work, the current authors investigated the use of optimization criteria for beamforming that exploit the non-Gaussianity of human speech. In particular, we examined beamforming algorithms designed to maximize the kurtosis or negentropy of the subband output of a generalized sidelobe canceller. These techniques, while effective, require making multiple passes through the data, and hence...

متن کامل

Speech enhancement based on hidden Markov model using sparse code shrinkage

This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...

متن کامل

Introducing Busy Customer Portfolio Using Hidden Markov Model

Due to the effective role of Markov models in customer relationship management (CRM), there is a lack of comprehensive literature review which contains all related literatures. In this paper the focus is on academic databases to find all the articles that had been published in 2011 and earlier. One hundred articles were identified and reviewed to find direct relevance for applying Markov models...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008